2025-07-02 |
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation |
Sixiang Chen et.al. |
2507.01961v2 |
2025-07-03 |
null |
2025-07-02 |
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks |
Rahul Ramachandran et.al. |
2507.01955v1 |
2025-07-02 |
null |
2025-07-02 |
3D Reconstruction and Information Fusion between Dormant and Canopy Seasons in Commercial Orchards Using Deep Learning and Fast GICP |
Ranjan Sapkota et.al. |
2507.01912v1 |
2025-07-02 |
null |
2025-07-02 |
On the influence of reference sample properties on magnetic force microscopy calibrations |
Baha Sakar et.al. |
2507.01911v1 |
2025-07-02 |
null |
2025-07-02 |
Modality-agnostic, patient-specific digital twins modeling temporally varying digestive motion |
Jorge Tapias Gomez et.al. |
2507.01909v2 |
2025-07-03 |
null |
2025-07-02 |
Spacetime reconstruction by order and number |
Mathias Braun et.al. |
2507.01907v1 |
2025-07-02 |
null |
2025-07-02 |
A computationally frugal open-source foundation model for thoracic disease detection in lung cancer screening programs |
Niccolò McConnell et.al. |
2507.01881v1 |
2025-07-02 |
null |
2025-07-02 |
Direct Vertex Reconstruction of $Λ$ Baryons from Hits in CLAS12 using Graph Neural Networks |
Keegan Menkce et.al. |
2507.01868v1 |
2025-07-02 |
null |
2025-07-02 |
An in-silico lung phantom to assess the performance of pulmonary artery segmentation using angiogram |
Sunder Neelakantan et.al. |
2507.01867v1 |
2025-07-02 |
null |
2025-07-02 |
Modulate and Reconstruct: Learning Hyperspectral Imaging from Misaligned Smartphone Views |
Daniil Reutsky et.al. |
2507.01835v1 |
2025-07-02 |
null |
2025-07-02 |
The star HIP 41378 potentially misaligned with its cohort of long-period planets |
S. Grouffal et.al. |
2507.01807v1 |
2025-07-02 |
null |
2025-07-02 |
HCNQA: Enhancing 3D VQA with Hierarchical Concentration Narrowing Supervision |
Shengli Zhou et.al. |
2507.01800v1 |
2025-07-02 |
null |
2025-07-02 |
Femtosecond signatures of optically induced magnons before ultrafast demagnetization |
Reza Rouzegar et.al. |
2507.01796v1 |
2025-07-02 |
null |
2025-07-02 |
The inverse source problem of stochastic wave equation |
Yunqing Huang et.al. |
2507.01789v1 |
2025-07-02 |
null |
2025-07-02 |
Global Energy Minimization for Simplex Mesh Optimization: A Radius Ratio Approach to Sliver Elimination |
Dong Wang et.al. |
2507.01762v1 |
2025-07-02 |
null |
2025-07-02 |
Microscale architected materials for elastic wave guiding: Fabrication and dynamic characterization across length and time scales |
Vignesh Kannan et.al. |
2507.01757v1 |
2025-07-02 |
null |
2025-07-02 |
Full Stokes magnetometry of the active M dwarfs AU Mic and EV Lac with SPIRou |
J. -F. Donati et.al. |
2507.01754v1 |
2025-07-02 |
null |
2025-07-02 |
Black hole optical analogue: photon sphere microlasers |
Chenni Xu et.al. |
2507.01751v1 |
2025-07-02 |
null |
2025-07-02 |
Calibrated Self-supervised Vision Transformers Improve Intracranial Arterial Calcification Segmentation from Clinical CT Head Scans |
Benjamin Jin et.al. |
2507.01744v1 |
2025-07-02 |
null |
2025-07-02 |
HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion |
Lin Wu et.al. |
2507.01737v2 |
2025-07-03 |
null |
2025-07-02 |
SE(3)-Equivariant Diffusion Policy in Spherical Fourier Space |
Xupeng Zhu et.al. |
2507.01723v1 |
2025-07-02 |
null |
2025-07-02 |
Laser cooling and qubit measurements on a forbidden transition in neutral Cs atoms |
J. Scott et.al. |
2507.01720v1 |
2025-07-02 |
null |
2025-07-02 |
GPT, But Backwards: Exactly Inverting Language Model Outputs |
Adrians Skapars et.al. |
2507.01693v1 |
2025-07-02 |
null |
2025-07-02 |
Facial Emotion Learning with Text-Guided Multiview Fusion via Vision-Language Model for 3D/4D Facial Expression Recognition |
Muzammil Behzad et.al. |
2507.01673v1 |
2025-07-02 |
null |
2025-07-02 |
Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation |
Camille Billouard et.al. |
2507.01631v1 |
2025-07-02 |
null |
2025-07-02 |
Prompt Guidance and Human Proximal Perception for HOT Prediction with Regional Joint Loss |
Yuxiao Wang et.al. |
2507.01630v1 |
2025-07-02 |
null |
2025-07-02 |
QHARMA-GAN: Quasi-Harmonic Neural Vocoder based on Autoregressive Moving Average Model |
Shaowen Chen et.al. |
2507.01611v1 |
2025-07-02 |
null |
2025-07-02 |
Phases of Tree-decorated Dynamical Triangulations in 3D |
Timothy Budd et.al. |
2507.01604v1 |
2025-07-02 |
null |
2025-07-02 |
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation |
Yue-Jiang Dong et.al. |
2507.01603v1 |
2025-07-02 |
null |
2025-07-02 |
Enhancing Multi-Exposure High Dynamic Range Imaging with Overlapped Codebook for Improved Representation Learning |
Keuntek Lee et.al. |
2507.01588v1 |
2025-07-02 |
null |