2025-07-02 |
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks |
Rahul Ramachandran et.al. |
2507.01955v1 |
2025-07-02 |
null |
2025-07-02 |
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model |
Yukang Cao et.al. |
2507.01953v1 |
2025-07-02 |
null |
2025-07-02 |
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory |
Nan Chen et.al. |
2507.01945v1 |
2025-07-02 |
null |
2025-07-02 |
3D Reconstruction and Information Fusion between Dormant and Canopy Seasons in Commercial Orchards Using Deep Learning and Fast GICP |
Ranjan Sapkota et.al. |
2507.01912v1 |
2025-07-02 |
null |
2025-07-02 |
An in-silico lung phantom to assess the performance of pulmonary artery segmentation using angiogram |
Sunder Neelakantan et.al. |
2507.01867v1 |
2025-07-02 |
null |
2025-07-02 |
On the Visibility Polynomial of Graphs |
Tonny K B et.al. |
2507.01851v1 |
2025-07-02 |
null |
2025-07-02 |
Out-of-Distribution Detection Methods Answer the Wrong Questions |
Yucen Lily Li et.al. |
2507.01831v1 |
2025-07-02 |
null |
2025-07-02 |
Autoadaptive Medical Segment Anything Model |
Tyler Ward et.al. |
2507.01828v1 |
2025-07-02 |
null |
2025-07-02 |
MILP-SAT-GNN: Yet Another Neural SAT Solver |
Franco Alberto Cardillo et.al. |
2507.01825v1 |
2025-07-02 |
null |
2025-07-02 |
Boosting Adversarial Transferability Against Defenses via Multi-Scale Transformation |
Zihong Guo et.al. |
2507.01791v1 |
2025-07-02 |
null |
2025-07-02 |
Are Vision Transformer Representations Semantically Meaningful? A Case Study in Medical Imaging |
Montasir Shams et.al. |
2507.01788v1 |
2025-07-02 |
null |
2025-07-02 |
A Deterministic Partition Tree and Applications |
Haitao Wang et.al. |
2507.01775v1 |
2025-07-02 |
null |
2025-07-02 |
Scheduling on identical machines with conflicts to minimize the mean flow time |
Nour ElHouda Tellache et.al. |
2507.01759v1 |
2025-07-02 |
null |
2025-07-02 |
Calibrated Self-supervised Vision Transformers Improve Intracranial Arterial Calcification Segmentation from Clinical CT Head Scans |
Benjamin Jin et.al. |
2507.01744v1 |
2025-07-02 |
null |
2025-07-02 |
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy |
Ming Dai et.al. |
2507.01738v1 |
2025-07-02 |
null |
2025-07-02 |
Soft Self-labeling and Potts Relaxations for Weakly-Supervised Segmentation |
Zhongwen Zhang et.al. |
2507.01721v1 |
2025-07-02 |
null |
2025-07-02 |
Component Adaptive Clustering for Generalized Category Discovery |
Mingfu Yan et.al. |
2507.01711v1 |
2025-07-02 |
null |
2025-07-02 |
Towards Better Attribute Inference Vulnerability Measures |
Paul Francis et.al. |
2507.01710v1 |
2025-07-02 |
null |
2025-07-02 |
Entropic optimal transport beyond product reference couplings: the Gaussian case on Euclidean space |
Paul Freulon et.al. |
2507.01709v1 |
2025-07-02 |
null |
2025-07-02 |
Simulating Quantum State Transfer between Distributed Devices using Noisy Interconnects |
Marvin Bechtold et.al. |
2507.01683v1 |
2025-07-02 |
null |
2025-07-02 |
Customized Exploration of Landscape Features Driving Multi-Objective Combinatorial Optimization Performance |
Ana Nikolikj et.al. |
2507.01638v1 |
2025-07-02 |
null |
2025-07-02 |
Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation |
Camille Billouard et.al. |
2507.01631v1 |
2025-07-02 |
null |
2025-07-02 |
Prompt Guidance and Human Proximal Perception for HOT Prediction with Regional Joint Loss |
Yuxiao Wang et.al. |
2507.01630v1 |
2025-07-02 |
null |
2025-07-02 |
Adaptive Estimation of the Number of Algorithm Runs in Stochastic Optimization |
Tome Eftimov et.al. |
2507.01629v1 |
2025-07-02 |
null |
2025-07-02 |
Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems |
Zhaoyan Sun et.al. |
2507.01599v1 |
2025-07-02 |
null |
2025-07-02 |
A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation |
Hao Wang et.al. |
2507.01573v1 |
2025-07-02 |
null |
2025-07-02 |
Efficient Out-of-Scope Detection in Dialogue Systems via Uncertainty-Driven LLM Routing |
Álvaro Zaera et.al. |
2507.01541v1 |
2025-07-02 |
null |
2025-07-02 |
Chargax: A JAX Accelerated EV Charging Simulator |
Koen Ponse et.al. |
2507.01522v1 |
2025-07-02 |
null |
2025-07-02 |
Mamba Guided Boundary Prior Matters: A New Perspective for Generalized Polyp Segmentation |
Tapas K. Dutta et.al. |
2507.01509v1 |
2025-07-02 |
null |
2025-07-02 |
Integrating Traditional and Deep Learning Methods to Detect Tree Crowns in Satellite Images |
Ozan Durgut et.al. |
2507.01502v1 |
2025-07-02 |
null |